Combinatorial Entropy Encoding
نویسنده
چکیده
This paper proposes a novel entropy encoding technique for lossless data compression. Representing a message string by its lexicographic index in the permutations of its symbols results in a compressed version matching Shannon entropy of the message. Commercial data compression standards make use of Huffman or arithmetic coding at some stage of the compression process. In the proposed method, like arithmetic coding entire string is mapped to an integer but is not based on fractional numbers. Unlike both arithmetic and Huffman coding no prior entropy model of the source is required. Simple intuitive algorithm based on multinomial coefficients is developed for entropy encoding that adoptively uses low number of bits for more frequent symbols. Correctness of the algorithm is demonstrated by an example.
منابع مشابه
From Clustering Supersequences to Entropy Minimizing Subsequences for Single and Double Deletions
A binary string transmitted via a memoryless i.i.d. deletion channel is received as a subsequence of the original input. From this, one obtains a posterior distribution on the channel input, corresponding to a set of candidate supersequences weighted by the number of times the received subsequence can be embedded in them. In a previous work it is conjectured on the basis of experimental data th...
متن کاملOn strong correlations and error-correction in the neural code of the retina
As they respond to naturalistic stimuli, ganglion cells in the vertebrate retina exhibit weak pairwise correlations, but the combinatorial patterns of spiking and silence from larger groups of cells occur with frequencies that differ by many orders of magnitude from the predictions of a model in which each cell acts independently. The central result of [1] is that these seemingly contradictory ...
متن کاملCombinatorial Information Theory: I. Philosophical Basis of Cross-Entropy and Entropy
This study critically analyses the information-theoretic, axiomatic and combinatorial philosophical bases of the entropy and cross-entropy concepts. The combinatorial basis is shown to be the most fundamental (most primitive) of these three bases, since it gives (i) a derivation for the Kullback-Leibler cross-entropy and Shannon entropy functions, as simplified forms of the multinomial distribu...
متن کاملar X iv : c on d - m at / 0 51 20 17 v 2 9 J an 2 00 6 Combinatorial Information Theory : I . Philosophical Basis of Cross - Entropy and Entropy
The three main theoretical bases of the concepts of entropy and cross-entropy informationtheoretic, axiomatic and combinatorial are critically examined. It is shown that the combinatorial basis, proposed by Boltzmann and Planck, is the most fundamental (most primitive) basis of these concepts, since it provides (i) a derivation of the Kullback-Leibler cross-entropy and Shannon entropy functions...
متن کاملBinary Combinatorial Coding
We present a novel binary entropy code called combinatorial coding (CC). The theoretical basis for CC has been described previously under the context of universal coding [1], enumerative coding [2], and minimum description length [3]. The code described in these references works as follows: assume the source data of length M is binary, memoryless, and generated with an unknown parameter θ, the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1703.08127 شماره
صفحات -
تاریخ انتشار 2017